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SYNTHESIS OF SPATIALLY ADDRESSED MOLECULAR ARRAYS 
Field of the Invention 

This invention relates to fabricated arrays of polymers. In particular, this 
invention relates to the production of spatially addressed polymer arrays. 
5 Background of the Invention 

Advances in the study of molecules have been led, in part, by 
improvement in technologies used to characterise the molecules or their 
biological reactions. In particular, the study of nucleic acid, DNA and RNA, has 
benefitted from developing technologies used for sequence analysis and the 
10 study of hybridisation events. 

An example of the technologies that have improved the study of nucleic 
acids, is the development of fabricated arrays of immobilised nucleic acids. 
These arrays typically consist of a high-density matrix of polynucleotides 
immobilised onto a solid support material. Fodor etai, Trends in Biotechnology 
15 (1994) 12:19-26, describes ways of assembling the nucleic acid arrays using 
a chemically sensitised glass surface protected by a mask, but exposed at 
defined areas to allow attachment of suitably modified nucleotides. 

An alternative approach is described by Schena et a/., Science (1995) 
270:467-470, where samples of DNA are positioned at predetermined sites on 
20 a glass microscope slide by robotic micropipetting techniques. The DNA is 
attached to the glass surface through its entire length by non-covalent 
electrostatic interactions. 

The arrays are usually provided to study hybridisation events, determine 
the sequence of DNA (Mirzabekov, Trends in Biotechnology (1994) 12:27-32) 
25 or to detect mutations in a particular DNA sample. Many of these hybridisation 
events are detected using fluorescent labels attached to nucleotides with 
fluorescence detected using sensitive fluorescent detector, e.g. charge 
coupled detector (CCD). However, the major disadvantages of these methods 
are that it is not possible to sequence long stretches of DNA and repeat 
30 sequences can lead to ambiguity in the results. These problems are 
recognised in Automation Technologies for Genome Characterisation, Wiley- 
Interscience, 1997, Ed. T. J. Beugelsdijk, Chapter 10: 205-225. 
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In addition, the use of multi-molecule high-density arrays in a multi-step 
analysis procedure can lead to problems with phasing. Phasing problems 
result from a loss in the synchronisation of a reaction step occurring on different 
molecules of the array. If a proportion of the arrayed molecules fails to undergo 
5 a step in the procedure, subsequent results obtained for these molecules will 
no longer be in-step with results obtained for the other arrayed molecules. The 
proportion of molecules out of phase will increase through successive steps 
and consequently the results detected will become ambiguous. This problem 
is recognised in the sequencing procedure described in US-A-5302509. 
10 Summary of the Invention 

According to the present invention, a method for forming a spatially 
addressable array of polymers immobilised on a solid support comprises the 
steps of: 

(i) contacting an array of single molecules with one or more 
15 detectably labelled monomers, under conditions that permit 

incorporation of a monomer onto a molecule of the array, wherein 
the labelled monomer comprises a removable blocking group that 
prevents further monomer incorporation occurring; 

(ii) removing non-incorporated monomers and detecting the label on 
20 the incorporated monomer; 

(iii) removing the blocking group and any separate label; and 

(iv) optionally repeating steps (i) - (iii) to form a single polymer of 
defined sequence; 

wherein the array has a surface density which allows each polymer to be 

25 individually resolved by optical microscopy. 

According to the present invention, high-density single polymer arrays 
are synthesised in a manner that permits the sequence of each polymer to be 
determined. As the sequence for each polymer is known, the result of the 
synthesis is a spatially addressed array. Further, the random addition of 

3 o monomers to the growing polymer strands in the synthesis procedure allows a 
vast diversity of different polymers to be formed. 
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The formation of spatially addressed high-density arrays has many 
important benefits for the study of the single polymer molecules and their 
interactions with other biological molecules. The arrays are particularly suitable 
for DNA analysis procedures using hybridisation-based approaches. Knowing 
5 the sequence of polynucleotides (polymers) on the array enables the user to 
quickly determine the sequence of a complementary polynucleotide hybridised 
thereto. 

Description of the Invention 

The present invention relates to the formation of single molecule polymer 

10 arrays using a step-wise synthesis procedure, whereby the identity of each 
monomer is determined at each incorporation step. 

The term "single molecule" and "single polymer" is used herein to 
distinguish from high-density, multi-molecule arrays in the prior art, which may 
comprise distinct clusters of many molecules of the same type. 

15 The term "individually resolved" is used herein to indicate that, when 

visualised, it is possible to distinguish one polymer on the array from its 
neighbouring polymers. Visualisation may be effected by the use of reporter 
labels, e.g. fluorophores, the signal of which is individually resolved. The 
requirement for individual resolution ensures that individual monomer 

20 incorporation can be detected at each synthesis step. 

In general, the method may be carried out using conventional synthesis 
techniques which utilise the step-wise incorporation of monomers onto a 
growing polymer strand. 

The synthesised polymers may be of any biomolecule or organic 

25 molecule, including peptides and polypeptides. The polymers are preferably 
polynucleotides, e.g. DNA or RNA, and the monomers for incorporation may be 
the bases adenine (A), thymine (T), guanine (G) and cytidine (C). Uracil (U) 
may also be used. 

The monomers should be detectably-labeled and include a blocking 

3 o group to prevent incorporation of further monomers until after the detection step 
has been carried out. In one preferred embodiment, the label is, or is part of, 
the blocking group, and can be removed under defined conditions. Different 
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monomer types will usually be labeled with a distinct label. For example, in the 
context of DNA synthesis, each monomer base will have a specific label which 
characterises the base. This enables the stepwise incorporation of monomers 
to be monitored during the synthesis procedure. 
5 Preparation of monomers with suitable labels and blocking groups will 

be apparent to the skilled person. For DNA, conventional phosphoramidite 
chemistries may be used. The label (fluorophore) may be located on a 
protecting group or may be located at a separate position. A skilled person will 
appreciate that cleavable linker groups can be readily prepared, as in 

10 US-A-5302509. 

Suitable labels will also be apparent to the skilled person. In a preferred 
embodiment, the label is a fluorophore. Alternative labels may be used. A 
number of strategies for labelling molecules of DNA have been reported, such 
as microspheres (Anal. Chem. (2000) 72 ( 15: 3678-3681), gold nanoparticles 

15 (J. Am. Chem. Soc, (2000) 122, 15: 3795-3796), silver colloid particles (PNAS, 
(2000) 97, 3: 996-1 001 ) and quantum dots. Any labelling technique that allows 
unambiguous identification of the incorporated moiety can be utilised in this 
scheme. 

The first step in the synthesis procedure will be to form an array of single 
20 molecules, onto which the monomers are to be incorporated. Immobilisation of 
the single molecules to the surface of a solid support may be carried out by any 
known technique. Generally the array is produced by dispensing small volumes 
of a sample onto a suitably prepared solid surface, or by applying a dilute 
solution to the solid surface to generate a random array. Immobilisation may 
25 occur by covalent or non-covalent interactions. 

The single molecules may themselves be monomers, prepared so that 
immobilisation with the solid support can occur. If the molecule is a monomer 
base, immobilisation will preferably occur at the 3'-position to permit 
incorporation at the 5'-position. Various linker molecules, e.g. polyethylene 
30 glycol, may also be present. Further details of the preparation of these single 
molecule arrays is disclosed in WO-A-00/06770. 
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If the polymer is a polynucleotide, synthesis may be carried out by the 
use of conventional soiid-phase DNA synthesis techniques, e.g. using 
phosphoramidite chemistry, as disclosed in "Nucleic Acids in Chemistry and 
Biology" by Blackburn & Gait, Oxford University Press, pages 118-137, 
5 Tetrahedron Letters (1 990) 31 49: 7095-7098, and Tetrahedron Letters (2000) 
56: 271 3-2724. If a fluorescently-modified 5-protecting group is used with the 
phosphoramidite, then the deprotection and removal of the fluorescent label 
can be carried out in a single step after each round of synthesis. Each round 
of synthesis may comprise one or more different monomers, e.g. the bases G, 

10 C, A and T. The array may be synthesised randomly by incorporating all the 
different monomers during each round of synthesis, or in a more controlled 
fashion, using only one distinct monomer in each round of synthesis. 

The density of the arrays is not critical. However, the present invention 
can make use of a high-density of single polymer molecules, and these are 

15 preferable. For example, arrays with a density of 1 0 6 -1 0 9 polymers per cm 2 may 
be used. Preferably, the density is at least 1 0 7 /cm 2 and typically up to 1 0 8 /cm 2 . 
These high-density arrays are in contrast to other arrays which may be 
described in the art as "high-density" but which are not necessarily as high 
and/or which do not allow single molecule resolution. 

20 The extent of separation between the individual polymers on the array 

will be determined, in part, by the particular technique used to resolve the 
individual polymer molecule. Apparatus used to image molecular arrays are 
known to those skilled in the art. For example, a confocal scanning microscope 
may be used to scan the surface of the array with a laser to image directly a 

25 fluorophore incorporated on the individual polymer by fluorescence. 
Alternatively, a sensitive 2-D detector, such as a charge-coupled detector, can 
be used to provide a 2-D image representing the individual polymers on the 
array. 

Resolving single polymer molecules on the array with a 2-D detector can 
30 be done if, at 100 x magnification, adjacent polymers are separated by a 
distance of approximately at least 250nm, preferably at least 300nm and more 
preferably at least 350nm. It will be appreciated that these distances are 
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dependent on magnification, and that other values can be determined 
accordingly, by one of ordinary skill in the art. 

Other techniques such as scanning near-field optical microscopy 
(SNOM) are available which are capable of greater optical resolution, thereby 
5 permitting more dense arrays to be used. For example, using SNOM, adjacent 
polymers may be separated by a distance of less than 100nm, e.g. 10nm. For 
a description of scanning near-field optical microscopy, see Moyer et ai , Laser 
Focus World (1993) 29(10). 

An additional technique that may be used is surface-specific total 
10 internal reflection fluorescence microscopy (TIRFM); see, for example, Vale et 
a/., Nature, (1996)380:451-453). Using this technique, it is possible to achieve 
wide-field imaging (up to 100 pm x 100 pm) with single polymer molecule 
sensitivity. This may allow arrays of greater than 1 0 7 resolvable polymers per 
cm 2 to be used. 

is Additionally, the techniques of scanning tunnelling microscopy (Binnig 

ef a/., Helvetica Physica Acta (1982) 55:726-735) and atomic force microscopy 
(Hansma ef a/., Ann. Rev. Biophys. Biomol. Struct. (1994) 23:115-139) are 
suitable for imaging the arrays of the present invention. Other devices which 
do not rely on microscopy may also be used, provided that they are capable of 

20 imaging within discrete areas on a solid support. 

Suitable solid supports are available commercially, and will be apparent 
to the skilled person. The supports may be manufactured from materials such 
as glass, ceramics, silica and silicon. The supports usually comprise a flat 
(planar) surface, or at least an array in which the polymers are in the same 

25 plane. Any suitable size may be used. For example, the supports might be of 
the order of 1-10 cm in each direction. 

It is important to prepare the solid support under conditions which 
minimise or avoid the presence of contaminants. The solid support must be 
cleaned thoroughly, preferably with a suitable detergent, e.g. Decon-90, to 

3 o remove dust and other contaminants. 

Because the array consists of optically resolvable polymers, the 
synthesis of each target polymer will generate a series of distinct signals as the 
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fluorescent events are detected. Details of the full sequence may then be 
determined. 

The sequence of the polymers is determined by the random 
incorporation of the monomers and not by the presence of any template 
5 molecule. Sequencing procedures are therefore not required, i.e. procedures 
requiring the use of the polymerase enzyme. 

The arrays of the invention are particularly suitable for analysis 
procedures where the spatially addressable polymers can be used to reveal 
information on an interacting molecule. For example, if the polymers are 
i o polynucleotides, the arrays may be used in hybridisation-based procedures, to 
reveal the sequence of target DNA which hybridises on the array. Uses of 
spatially addressed arrays are disclosed in WO-A-00/06770. 
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CLAIMS 

1. A method for forming a spatially addressable array of polymers 
immobilised on a solid support, comprising the steps of: 

(i) contacting an array of single molecules with one or more 
5 detectably labelled monomers, under conditions that permit 

incorporation of a monomer onto a molecule of the array, wherein 
the labelled monomer comprises a removable blocking group that 
prevents further monomer incorporation occurring; 

(ii) removing non-incorporated monomers and detecting the label on 
io the incorporated monomer; 

(iii) removing the blocking group and any separate label; and 

(iv) optionally repeating steps (i) - (iii) to form a single polymer of 
defined sequence; 

wherein the array has a surface density which allows each polymer to be 
15 individually resolved by optical microscopy. 

2. A method according to claim 1 , wherein the polymer is a polynucleotide, 
and the monomers are any of the bases A, C, T and G. 

3. A method according to claim 2, wherein each of the bases A t C, T and 
G comprises a different label, and step (i) is carried out in the presence of all 

20 four bases. 

4. A method according to any preceding claim, wherein the label is a 
fluorophore. 

5. A method according to claim 4, wherein the label is detected using a 2-D 
fluorescent imaging device, a confocal fluorescence microscope or a CCD 

25 camera. 

6. A method according to claim 4 or claim 5, wherein the label is removed 
by photobleaching or by chemical or enzymatic cleavage. 

7. A method according to any preceding claim, wherein the array has a 
density of from 10 5 to 10 9 polymers per cm 2 . 

3 0 8. A method according to claim 9, wherein the density is 10 7 to 10 8 
polymers per cm 2 . 
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9. A method according to any preceding claim, wherein the polymers are 
separated by a distance of at least 100nm. 

10. A method according to claim 9, wherein the polymers are separated by 
a distance of at least 250nm. 
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THE PREPARATION OF POLYNUCLEOTIDE ARRAYS 

Field of the Invention 

This invention relates to fabricated arrays of polynucleotides, and to their analytical 

applications. 
5 Background of the Invention 

Advances iri-the study of molecules have been led, in part, by improvement in 
technologies used to characterise the molecules or their biological reactions. In particular, 
the study of nucleic acids, DNA and KNA, has benefited from developing technologies 
used for sequence analysis and the study of hybridisation events. 

10 An example of the technologies that have improved the study of nucleic acids, is 

the development of fabricated arrays of immobilised nucleic acids. These arrays typically 
consist of a high-density matrix of polynucleotides immobilised onto a solid support 
material. Fodor et aL, Trends in Biotechnology (1994) 12:19-26, describes ways of 
assembling the nucleic acid arrays using a chemically sensitised glass surface protected by 

1 5 a mask, but exposed at defined areas to allow attachment of suitably modified nucleotides. 
Typically, these arrays maybe described as "many molecule" arrays, as distinct regions are 
formed on the solid support comprising a high density of one specific type of 
polynucleotide. 

An alternative approach is described by Schena et al. y Science (1995) 270:467- 
20 470, where samples of DNA are positioned at predetermined sites on a glass microscope 
slide by robotic micropipetting techniques. The DNA is attached to the glass surface along 
its entire length by non-covalent electrostatic interactions. However, although 
hybridisation with complementary DNA sequences can occur, this approach may not 
permit the DNA to be freely available for interacting with other components such as 
25 polymerase enzymes, DNA-binding proteins etc. 

WO-A-96/27025 is a general disclosure of single molecule arrays. Although 
sequencing procedures are disclosed, there is little description of the applications to which 
the arrays can be applied. There is also only a general discussion on how to prepare the 
arrays. 

30 Summary of the Invention 

According to the present invention, a device comprises a high density array of 
relatively short molecules and relatively long polynucleotides, immobilised on the surface 
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of a solid support, wherein the polynucleotides are at a density that permits individual 
resolution of those parts that extend beyond the relatively short molecules. In this aspect, 
the shorter molecules help to control the density of the polynucleotides, providing a more 
uniform array of single polynucleotide molecules, thereby improving imaging. The small 
5 molecules may also prevent non-specific binding of reagents to the solid support, and 
therefore reduce background interference. For example, in the context of a polymerase 
reaction to incorporate nucleoside triphosphates onto a strand complementary to a long 
polynucleotide, the small molecules prevent the polymerase and nucleosides from attaching 
to the solid support surface, which may otherwise interfere with the imaging process. 
10 The shorter molecules may also ensure that each polynucleotide is maintained 

upright, preventing the polynucleotides from interacting lengthwise with the solid support, 
which may otherwise prevent efficient interaction with a reagent, e.g. a polymerase. This 
may also prevent the fluorophore being quenched by the surface and therefore lead to 
more accurate imaging of the single polynucleotides. 

1 5 According to a second aspect of the invention, a method for the production of an 

array of polynucleotides which are at a density that permits individual resolution, 
comprises arraying on the surface of a solid support, a mixture of relatively short 
molecules and relatively long polynucleotides, wherein the short molecules are arrayed in 
an amount in excess of the polynucleotides. 

20 The arrays of the present invention comprise what are effectively single analysable 

polynucleotides. This has many important benefits for the study of the polynucleotides and 
their interaction with other biological molecules. In particular, fluorescence events 
occurring on each polynucleotide can be detected using an optical microscope linked to 
a sensitive detector, resulting in a distinct signal for each polynucleotide. 

25 When used in a multi-step analysis of a population of single polynucleotides, the 

phasing problems that are encountered using high density (multi-molecule) arrays of the 
prior art, can be reduced or removed. Therefore, the arrays also permit a massively 
parallel approach to monitoring fluorescent or other events on the polynucleotides. Such 
massively parallel data acquisition makes the arrays extremely useful in a wide range of 

3 0 analysis procedures which involve the screening/characterising of heterogeneous mixtures 
of polynucleotides. 
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The preparation of the arrays requires only small amounts of polynucleotide sample 
and other reagents, and can be carried out by simple means. 
Description of the Drawings 

Figures la and b are images of a single polynucleotide array, where single 
5 polynucleotides are indicated by the detection of a fluorescent signal generated on the 
array. 

Description of the Invention 

The single polynucleotide array devices of the present invention are fabricated to 
include a "monolayer" of relatively short molecules that coat the surface of a solid support 

10 material and provide a flexible means to control the density of the single polynucleotides 
and optionally to prevent non-specific binding of reagents to the solid support. 

In the context of the present invention, the terms "relatively short" and "relatively 
long" should be interpreted to mean that the "relatively long" polynucleotides extend 
above the "relatively short" molecules when arrayed. 

1 5 The single polynucleotides immobilised onto the surface of a solid support should 

be capable of being resolved by optical means. This means that, within the resolvable area 
of the particular imaging device used, there must be one or more distinct images each 
representing one polynucleotide. Typically, the polynucleotides of the array are resolved 
using a single molecule fluorescence microscope equipped with a sensitive detector, e.g. 

20 a charge-coupled device (CCD). Each polynucleotide of the array may be imaged 
simultaneously or, by scanning the array, a fast sequential analysis can be performed. 

The polynucleotides of the array are typically DNA or RNA, although nucleic acid 
mimics, e.g. PNA or 2-O-Meth-RNA, are within the scope of the invention. The 
polynucleotides are formed on the array to allow interaction with other molecules. It is 

25 therefore important to immobilise the polynucleotides so that the portion of the 
polynucleotide not physically attached to solid support is capable of being interrogated. 
In some applications all the polynucleotides in the single array will be the same, and may 
be used to capture molecules that are largely distinct. In other applications, the 
polynucleotides on the array may all, or substantially all, be different, e.g. less than 50%, 

30 preferably less than 30% of the polynucleotides will be the same. 
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The term "single molecule" is used herein to distinguish from high density multi- 
molecule (polynucleotide) arrays in the prior art, which may comprise distinct clusters of 
many polynucleotides of the same type. 

The term "individually resolved" is used herein to indicate that, when visualised, 
5 it is possible to distinguish one polynucleotide on the array from its neighbouring 
polynucleotides. Visualisation may be effected by the use of reporter labels, e.g. 
fluorophores, the signal of which is individually resolved. There may be some 
polynucleotides present on the solid support that are not capable of being individually 
resolved, however, these can be discounted during imaging, provided that the majority of 
10 the polynucleotides can be resolved at the single molecule level. 

The term "interrogate" is used herein to refer to any interaction of the arrayed 
polynucleotide with any other molecule, e.g. with a polymerase or nucleoside triphosphate. 

The density of the arrays is not critical. However, the present invention can make 
use of a high density of single polynucleotides, and these are preferable. For example, 
15 arrays with a density of 10 6 -10 9 polynucleotides per cm 2 may be used. Preferably, the 
density is at least 10 7 /cm 2 and typically up to lOVcm 2 These high density arrays are in 
contrast to other arrays which may be described in the art as "high density" but which are 
not necessarily as high and/or which do not allow single molecule resolution. 

The shorter molecules will typically be present on the array at much higher density, 
20 to coat the remaining surface of the solid support. The shorter molecules may therefore 
be brought into contact with the solid support at an excess concentration. Preferably, the 
small molecules are at a density of from 10 8 to 1 0 14 molecules/cm 2 , more preferably greater 
than 10 12 molecules/cm 2 . 

Using the methods and apparatus of the present invention, it may be possible to 
25 image at least 10 s or 10 8 polynucleotides/cm 2 , preferably at least 10 7 polynucleotides/cm 2 . 
Fast sequential imaging may be achieved using a scanning apparatus; shifting and transfer 
between images may allow higher numbers of polynucleotides to be imaged. 

The extent of separation between the individual polynucleotides on the array will 
be determined, in part, by the particular technique used to resolve the individual 
30 polynucleotide. Apparatus used to image molecular arrays are known to those skilled in 
the art. For example, a confocal scanning microscope may be used to scan the surface of 
the array with a laser to image directly a fluorophore incorporated on the individual 
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polynucleotide by fluorescence. Alternatively, a sensitive 2-D detector, such as a charge- 
coupled device, can be used to provide a 2-D image representing the individual 

polynucleotides on the array. 

Resolving single polynucleotides on the array with a 2-D detector can be done if, 

5 at 100 x magnification, adjacent polynucleotides are separated by a distance of 
approximately at least 250nm, preferably at least 300nm and more preferably at least 
350nm. It will be appreciated that these distances are dependent on magnification, and 
that other values can be determined accordingly, by one of ordinary skill in the art. 

Other techniques such as scanning near-field optical microscopy (SNOM) are 

1 0 available which are capable of greater optical resolution, thereby permitting more dense 
arrays to be used. For example, using SNOM, adjacent polynucleotides may be separated 
by a distance of less than lOOnm, e.g. lOnm. For a description of scanning near-field 
optical microscopy, see Moyer et al, Laser Focus World (1993) 29(10). 

An additional technique that may be used is surface-specific total internal reflection 

1 5 fluorescence microscopy (TIRFM); see, for example, Vale et al, Nature, (1996) 380: 451- 
453). Using this technique, it is possible to achieve wide-field imaging (up to 100 um x 
100 um) with single molecule sensitivity. This may allow arrays of greater than 10 7 
resolvable polynucleotides per cm 2 to be used. 

Additionally, the techniques of scanning tunnelling microscopy (Binnig et al, 

20 Helvetica Physica Acta (1982) 55:726-735) and atomic force microscopy (Hansma eta!., 
Arm. Rev. Biophys. Biomol. Struct. (1 994) 23 : 1 1 5- 1 39) are suitable for imaging the arrays 
of the present invention. Other devices which do not rely on microscopy may also be 
used, provided that they are capable of imaging within discrete areas on a solid support. 
The devices according to the invention comprise immobilised polynucleotides and 

25 other irnmobilised molecules. The other molecules are relatively short compared to the 
polynucleotides and are used to control the density of the polynucleotides. They may also 
prevent non-specific attachment of reagents, e.g. nucleoside triphosphates, with the solid 
support, thereby reducing background interference. In one embodiment, the shorter 
molecules are also polynucleotides. However, many different molecules may be used, e.g. 

30 peptides, proteins, polymers and synthetic chemicals, as will be apparent to the skilled 
person. The preferred molecules are organic molecules that contain groups that can react 
with the surface of a solid support. 
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Preparation of the devices may be carried out by first preparing a mixture of the 
relatively long polynucleotides and of the relatively short molecules. Usually, the 
concentration of the latter will be in excess of that of the long polynucleotides. The 
mixture is then placed in contact with a suitably prepared solid support, to allow 
5 immobilisation to occur. 

Single polynucleotides maybe immobilised to the surface of a solid support by any 
known technique, provided that suitable conditions are used to ensure adequate 
separation. Density of the polynucleotide molecules may be controlled by dilution. The 
gaps between the polynucleotides can be filled in with short molecules (capping groups) 

1 0 that may be small organic molecules or may be polynucleotides of different composition. 
The formation of the array of individually resolvable "longer" polynucleotides permits 
interrogation of those polynucleotides that are different from the bulk of the molecules. 

Suitable solid supports are available commercially, and will be apparent to the 
skilled person. The supports maybe manufactured from materials such as glass, ceramics, 

1 5 silica and silicon. Supports with a gold surface may also be used. The supports usually 
comprise a flat (planar) surface, or at least a structure in which the polynucleotides to be 
interrogated are in the same plane. Any suitable size may be used. For example, the 
supports might be of the order of 1-10 cm in each direction. 

Immobilisation may be by specific covalent or non-covalent interactions. Covalent 

20 attachment is preferred. Immobilisation of a polynucleotide will be carried out at either 
the 5' or 3' position, so that the polynucleotide is attached to the solid support at one end 
only. However, the polynucleotide may be attached to the solid support at any position 
along its length, the attachment acting to tether the polynucleotide to the solid support; 
this is shown for the hairpin constructs, described below. The immobilised (relatively 

25 long) polynucleotide is then able to undergo interactions with other molecules or cognates 
at positions distant from the solid support. Immobilisation in this manner results in well 
separated long polynucleotides. The advantage of this is that it prevents interaction 
between neighbouring long polynucleotides on the array, which may hinder interrogation 
of the array. 

30 Suitable methods for forming the devices with relatively short molecules and 

relatively long polynucleotides will be apparent to the skilled person, based on 
conventional chemistries. The aim is to produce a highly dense layer of the relatively short 
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molecules, interspersed with the relatively large polynucleotides which are at a density that 

permits resolution of each single polynucleotide. 

A first step in the fabrication of the arrays will usually be to functionalise the 

surface of the solid support, making it suitable for attachment of the 
5 molecules/polynucleotides. For example, silanes are known functional groups that have 

been used to attach molecules to a solid support material, usually a glass slide. The 

relatively short molecules and relatively long polynucleotides can then be brought into 

contact with the functionalised solid support, at suitable concentrations and in either 

separate or combined samples, to form the arrays. 
10 In one preferred embodiment, the long polynucleotides and the short molecules 

each have the same reactive group that attaches to the solid support, or to an intermediary 

molecule. 

In an alternative embodiment, the support surface may be treated with different 
functional groups, one of which is to react specifically with the relatively short molecules, 

15 and the other with the relatively long polynucleotides. Controlling the concentration of 
each functional group provides a convenient way to control the densities of the 
molecules/polynucleotides. 

In a still further embodiment, the relatively short molecules are immobilised at high 
density onto the surface of the solid support. The molecules are capable of reacting with 

20 the polynucleotides (either directly orthrough an intermediate functional group) which can 
be brought into contact with the molecules at a suitable concentration to provide the 
required density. The polynucleotides are therefore immobilised on top of the monolayer 
of molecules. Those molecules that are not in contact with a polynucleotide may be 
reacted with a further molecule to block (or cap) the reactive site. This may be carried out 

25 before, during or after arraying the polynucleotides. The blocking (capping) group may 
itself be a relatively short polynucleotide. 

Alternatively, only a minor proportion of the short molecules that are arrayed at 
high density on the solid support comprise a group that reacts with the polynucleotides; 
the majority are non-reactive. For example, the short molecules can be mixed silanes, a 

30 minor proportion ofwhichare reactive with afunctional group on the polynucleotides, and 
the remaining silanes are unreactive and form the array of short molecules on the device. 
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Therefore, controlling the concentration of the minor proportion of short molecules also 
controls the density of the polynucleotides. 

In this embodiment, the short molecules may have been modified in solution prior 
to immobilisation on the array so that only a minor proportion contain a fiinctional group 

5 that is capable of undergoing covalent attachment to a complementary functional group 
on the polynucleotides. 

In a related embodiment, the short molecules are polynucleotides, and appropriate 
concentrations of both relatively long and relatively short polynucleotides are reacted with 
a functional group and then arrayed on the solid support, or to an intermediate molecule 

10 bound to the solid support. 

Suitable functional groups will be apparent to the skilled person. For example, 
suitable groups include: amines, acids, esters, activated acids, acid halides, alcohols, thiols, 
disulfides, olefins, dienes, halogenated electrophiles and phosphorothioates. It is preferred 
if the group contains a silane. 

15 - The relatively small molecules may be any molecule that can provide a barrier 

against non-specific binding to the solid support. 

Suitable small molecules may be selected based on the required properties of the 
surface and the existing functionality. 

In a preferred embodiment, the molecules are silanes of type I^SiX^ (where R 

20 is an inert moiety that is displayed on the surface of the solid support and X is a reactive 
leaving group of type CI or O-alkyl). The silanes include tetraethoxysilane, 
triethoxymethylsilane, diethoxydimethylsilaneorglycidoxypropyltriethoxysilane, although 
many other suitable examples will be apparent to the skilled person. 

In an embodiment of the invention, the short molecules act as surface blocks to 

25 prevent random polynucleotide association with the surface of the solid support. 
Molecules therefore require a group to react with the surface (which will preferably be the 
same functionality as used to attach the polynucleotide to the surface) and an inert group 
that will be defined by the properties required on the surface. In an embodiment, the 
surface is fonctionalised with an epoxide and the small molecule is glycine, although other 

30 compounds containing an amine group would suffice. 
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It is also preferred if the small molecule is hydrophilic and repels binding of anions. 
The molecule therefore may be acid, phosphate, sulfate, hydroxyl or polyol and may 
include polyethers such as PEG. 

In one embodiment, the relatively short molecules are polynucleotides. These may 

5 be prepared using any suitable technique, including synthetic techniques known in the art. 
It maybe preferable to use short polynucleotides that are immobilised to the solid support 
at one end and comprise, at the other end, a non-reactive group, e.g. a dideoxynucleotide 
incapable of incorporating further nucleotides. The short polynucleotide may also be a 
hairpin construct, provided that it does not interact with a polymerase. 

10 In one embodiment of the present invention, each relatively long polynucleotide 

of the array comprises a hairpin loop structure, one end of which comprises a target 
polynucleotide, the other end comprising a relatively short polynucleotide capable of 
acting as a primer in a polymerase reaction. This ensures that the primer is able to perform 
its priming function during a polymerase-based sequencing procedure, and is not removed 

1 5 during any washing step in the procedure. The target polynucleotide is capable of being 
interrogated. 

The term "hairpin loop structure" refers to a molecular stem and loop structure 
formed from the hybridisation of complementary polynucleotides that are covalently 
linked. The stem comprises the hybridised polynucleotides and the loop is the region that 

20 covalently links the two complementary polynucleotides. Anything from a 5 to 25 (or 
more) base pair double-stranded (duplex) region may be used to form the stem. In one 
embodiment, the structure may be formed from a single-stranded polynucleotide having 
complementary regions. The loop in this embodiment may be anything from 2 or more 
non-hybridised nucleotides. In a second embodiment, the structure is formed from two 

25 separate polynucleotides with complementary regions, the two polynucleotides being 
linked (and the loop being at least partially formed) by a linker moiety. The linker moiety 
forms a covalent attachment between the ends of the two polynucleotides. Linker moieties 
suitable for use in this embodiment will be apparent to the skilled person. For example, 
the linker moiety may be polyethylene glycol (PEG). 

30 If the short molecules are polynucleotides in a hairpin construct, it is possible to 

ligate the relatively long polynucleotides to a minor proportion of the hairpins either prior 
to or after arraying the hairpins on the solid support. 
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The arrays have many applications in methods which rely on the detection of 
biological or chemical interactions with polynucleotides. For example, the arrays may be 
used to determine the properties or identities of cognate molecules. Typically, interaction 
of biological or chemical molecules with the arrays are carried out in solution. 
5 In particular, the arrays may be used in conventional assays which rely on the 

detection of fluorescent labels to obtain information on the arrayed polynucleotides. The 
arrays are particularly suitable for use in multi-step assays where the loss of 
synchronisation in the steps was previously regarded as a limitation to. the use of arrays. 
The arrays may be used in conventional techniques for obtaining genetic sequence 

10 information. Many of these techniques rely on the stepwise identification of suitably 
labelled nucleotides, referred to inUS-A-5634413 as "single base" sequencing methods. 

In an embodiment of the invention, the sequence of a target polynucleotide is 
determined in a similar manner to that described in US-A-5634413, by detecting the 
incorporation of nucleotides into the nascent strand through the detection of a fluorescent 

15 label attached to the incorporated nucleotide. The target polynucleotide is primed with 
a suitable primer (or prepared as a hairpin construct which will contain the primer as part 
of the hairpin), and the nascent chain is extended in a stepwise manner by the polymerase 
reaction. Each of the different nucleotides (A, T, G and C) incorporates a unique 
fluorophore at the 3' position which acts as a blocking group to prevent uncontrolled 

20 polymerisation. The polymerase enzyme incorporates a nucleotide into the nascent chain 
complementary to the target, and the blocking group prevents further incorporation of 
nucleotides. The array surface is then cleared of unincorporated nucleotides and each 
incorporated nucleotide is "read" optically by a charge-coupled device using laser 
excitation and filters. The 3' -blocking group is then removed (deprotected), to expose 

25 the nascent chain for further nucleotide incorporation. 

Because the array consists of distinct optically resolvable polynucleotides, each 
target polynucleotide will generate a series of distinct signals as the fluorescent events are 
detected. Details of the full sequence are then determined. 

Other suitable sequencing procedures will be apparent to the skilled person. In 

30 particular, the sequencing method may rely on the degradation of the arrayed 
polynucleotides, the degradation products being characterised to determine the sequence. 
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An example of a suitable degradation technique is disclosed in WO- A- 95/20053, 
whereby bases on a polynucleotide are removed sequentially, a predetermined number at 
a time, through the use of labelled adaptors specific for the bases, and a defined 
exonuclease cleavage. 

5 A consequence of sequencing using non-destructive methods is that it is possible 

to form a spatially addressable array for further characterisation studies, and therefore non- 
destructive sequencing may be preferred. In this context, the term "spatially addressable" 
is used herein to describe how different molecules may be identified on the basis of their 
position on an array. 

10 Once sequenced, the spatially addressed arrays may be used in a variety of 

procedures which require the characterisation of individual molecules from heterogeneous 
populations. 

The following Examples illustrate the invention, with reference to the 
accompanying drawings. 
15 Example 1 

Glass slides were cleaned with decon 90 for 1 2 h at room temperature prior to use, 
rinsed with water, EtOH and dried. A solution of glycidoxypropyltrimethoxysilane (0.5 
mL) and mercaptopropyltrimethoxysilane (0.0005 mL) in acidified 95% EtOH (50 mL) 
was mixed for 5 min. The clean, dried slides were added to this mixture and left for 1 h 

20 at room temperature rinsed with EtOH, dried and cured for 1 h at 100° C. Maleimide 
modified DNA was prepared from a solution of amino-DNA (S^CyS- 
CtgCTgAAgCgTCggCAggT-heg-aminodT-heg^ACCTgCCgACgCT; SEQIDNO. 1)(10 
HM, 100 |iL) and N-[y-Maleimidobutryloxy]succinimide ester (GMBS); (Pierce) (1 mM) 
in DMF/diisopropylethylamine (DIPEA)/water (89/1/10) for 1 h at room temperature. 

25 The excess cross-linker was removed using a size exclusion cartridge (NAPS) and the 
eluted DNA freeze-dried in aliquots and freshly diluted prior to use. An aliquot of the 
maleimide-GMBS-DNA (100 nM) was placed on the thiol surface in 50 mM potassium 
phosphate/1 mM EDTA (pH 7.6) and left for 12 h at room temperature prior to washing 
with the same buffer. 

30 The slide was inverted so that the chamber coverslip contacted the objective lens 

of an inverted microscope (Nikon TE200) via an immersion oil interface. A 60° fused 
silica dispersion prism was optically coupled to the back of the slide through a thin film of 
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glycerol Laser light was directed at the prism such that at the glass/sample interface it 
subtended an angle of approximately 68° to the normal of the slide and subsequently 
underwent Total Internal Reflection (TIR). Fluorescence from the surface produced by 
excitation with the surface specific evanescent wave generated by TTR was collected by 

5 the objective lens of the microscope and imaged onto an intensified charged coupled 
device (ICCD) camera (Pentamax, Princeton Instruments). 

Images were recorded using a combination of a 532 Nd:YAG laser with a 
580DF30 emission filter (Omega optics), with an exposure of 500 ms and maximum 
camera gain and a laser power of 50 mW at the prism. 

1 0 The presence of glycidoxypropyltrimethoxysilane gave improved results (Fig. 1 a) 

compared to a control carried out in the absence of glyridoxypropyltrimethoxysilane. 
Example 2 

Slides were cleaned with decon 90 for 12 h prior to use and rinsed with water, 
EtOH and dried. A solution of tetraethoxysilane (0.7 mL) and N-(3- 
1 5 triethoxysilylpropyl)bromoacetamide (0.0007 mL) in acidified 95% EtOH (35 mL) was 
mixed for 5 min. The clean, dried slides were added to this mixture and left for 1 h at 
room temperature, rinsed withEtOH, dried and cured for 1 h at 100°C. Phosphorothioate 
modifiedDNA(5'-im-T^^ 

ACCgCAgCACgCTCgCCAgCg; SEQIDNO. 2) where s = phosphorothioate (100 pM, 
20 100 \iL) in sodium acetate (30 mM, pH 4.5) was added to the surface and left for 1 h at 
room temperature. The slide was washed with a buffer containing 50 mM Tris/1 mM 
EDTA. 

Imaging was performed as described in Example 1 and a good dispersion of single 
molecules was seen (Fig. lb). 
25 Example 3 

Slides were cleaned with decon 90 for 12 h prior to use and rinsed with water, 
EtOH and dried. A solution of glycidoxypropyltrimethoxysilane (0.5 mL) in acidified 95% 
EtOH was prepared and the cleaned slides placed in the solution for 1 h, rinsed with EtOH 
and dried. Amino modified DNA (S^CyS-CTgCTgAAgCgTCggCAggT-heg-aminodT- 
30 heg-ACCTgCCgACgCT; SEQ ID NO. 1) (1 |iM, 100 jiL) was placed on the surface and 
left for 12 h at room temperature. The slide was washed with a solution of 1 mM glycine 
at pH 9 for 1 h and flushed with 50 mM potassium phosphate/1 mM EDTA (pH 7.6). A 
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good dispersion of coupled single molecules was seen by TIR microscopy, as described 
in Example L 

The slide was then exposed to a mixture containing Cy5-dUTP (20 \iM) and T4 
exo-polymerase (250 riM) and Tris (40 mM), NaCl (10 mM), MgCl 2 (4 mM), DTT (2 
5 mM), potassium phosphate (1 mM), BSA (0.2 mgs/ml) 100 nL) at room temperature for 
10 min. and then flushed with Tris/EDTA buffer. 

Imaging was performed using a pumped dye laser at 630 nm with a 670DF40 
emission filter at 40 mW laser power using the TIR setup as described. A lower level of 
non-specific trisphosphate binding was seen in the case using glycine, than in a control not 
10 treated with glycine. 
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CLAIMS 

1. A device comprising a high density array of relatively short molecules and 
relatively long polynucleotides immobilised on the surface of a solid support, wherein the 
polynucleotides are at a density that permits individual resolution of those parts thereof 

5 that extend beyond the relatively short molecules. 

2. A device according to claim .1, wherein each polynucleotide is immobilised by 
covalent bonding to the surface. 

3 A device according to claim 1 , wherein the polynucleotides and the short molecules 
contain the same reactive group that attaches to the solid support. 
10 4. A device according to any preceding claim, wherein the polynucleotides are 
immobilised to the solid support via covalent attachment to an intermediate molecule and 
the short molecules are incapable of undergoing the same covalent attachment to the 
polynucleotides. 

5. A device according to claim 4, wherein the intermediate molecule and the short 
1 5 molecules are silane compounds. 

6. A device according to any preceding claim, wherein adjacent polynucleotides of 
the array are separated by a distance of at least lOnm. 

7. A device according to any preceding claim, wherein the polynucleotides are 
separated by a distance of at least lOOnm. 

20 8. A device according to any preceding claim, wherein the polynucleotides are 
separated by a distance of at least 250nm: 

9. A device according to any preceding claim, having a density of from 10 6 to 10 9 
polynucleotides per cm 2 . 

10. A device according to any preceding claim, wherein the density is from 1 0 7 to 1 0 8 

2 5 molecules per cm 2 . 

11. A device according to any preceding claim, wherein the relatively short molecules 

are polynucleotides. 

12. Use of a device according to any preceding claim, for monitoring an interaction 
with a single polynucleotide, comprising resolving an arrayed polynucleotide with an 

30 imaging device. 

13. A method for the production of an array of polynucleotides which are at a density 
that permits individual resolution, comprising 
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arraying on the surface of a solid support, a mixture of relatively long 
polynucleotides and relatively short molecules, wherein the short molecules are in excess 
of the polynucleotides. 

14. A method according to claim 13, wherein the polynucleotides and the short 
5 molecules each have the same reactive group that attaches to the solid support or to an 

intermediate molecule. 

15. A method according to claim 13 orclaim 14, wherein the polynucleotides and short 
molecules are brought into contact with the solid support in a single compositioa 

16. A method according to claim 1 3 or claim 14, wherein the short molecules and the 
10 polynucleotides are arrayed separately, with the short molecules being brought into 

contact with the solid support first. 

17. A method according to claim 1 6, wherein a minor proportion of the arrayed short 
molecules comprise a functional group that reacts covalently with a functional group on 
the polynucleotides to enable the polynucleotides to be arrayed. 

15 18. A method according to claim 16, wherein, prior to being arrayed, a minor 
proportion of short molecules are modified in solution to provide the functional group 
complementary to that on the polynucleotides. 

19. A method according to claim 1 6, wherein the short molecules contain a functional 
group that is capable of reacting covalently with a complementary group on the 

2 0 polynucleotides, and wherein the polynucleotides are brought into contact with the solid 

support at a concentration that permits only a minor proportion of the short molecules to 
undergo reaction with the polynucleotides. 

20. A method according to claim 1 9, wherein those molecules that are not reacted with 
a polynucleotide are reacted with a capping agent. 

25 21 . A method according to claim 20, wherein the capping agent is a relatively short 
polynucleotide. 

22. A method according to claim 13, wherein the relatively short molecules are 
polynucleotides, and both long and short polynucleotides are reacted with a functional 
group and then arrayed either directly onto the solid support or to an intermediate 

3 0 molecule bound to the solid support. 

23. Amethod according to claim 13, wherein the short molecules are polynucleotides 
in a hairpin construct, and the relatively long polynucleotides are ligated onto a minor 
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proportion of the short molecules either prior to or after attachment of the short molecules 
to the solid support. 
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Fig. la 
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SEQUENCE LISTING 

<110> Solexa Ltd. 

<120> The Preparation of Polynucleotide Arrays 

<130> REP07013WO 

<140> not yet known 
<141> 2002-01-30 

<150> 09/771708 
<151> 2001-01-30 

<160> 2 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> mis cofeature 
<222> (1) . . (34) 

<223> m = cytosine with a fluorescent Cy3 group 
attached. n = 

hexaethyleneglycol-aminodT-hexaethyleneglycol 

<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 1 

mtgctgaagc gtcggcaggt nacctgccga cgct 



<210> 2 
<211> 62 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<222> (1) . . (62) 

<223> n = thymine with a TMR group, s = thymine 
modified with phosphorothioate 



1 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 2 

naccgtcgac gtcgacgctg gcgagcgtgc tgcggtssss taccgcagca cgctcgccag 60 



2 



